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Abstract. Optimal B- robust estimate is constructed for multidimensional 
parameter in drift coefficient of diffusion type process with small noise. Op- 
Qh ' timal mean-variance robust (optimal T^-robust) trading strategy is find to 

r*j . hedge in mean-variance sense the contingent claim in incomplete financial 

market with arbitrary information structure and misspecificd volatility of 
asset price, which is modelled by multidimensional continuous semimartin- 
gale. Obtained results are applied to stochastic volatility model, where 
the model of latent volatility process contains unknown multidimensional 
parameter in drift coefficient and small parameter in diffusion term. 

, 1. Introduction, Motivation and Results 

The hedging and pricing of contingent claims in incomplete financial mar- 
\f~\ ■ kets, and dynamic portfolio selection problems are important issues in modern 

theory of finance. These problems are associated due to the so-called mean- 
q \ variance approach. 

For determining a "good" hedging strategy in incomplete market with ar- 
bitrary information structure F = (^ r )o<t<T, one riskless asset and d, d > 1, 
risky assets, whose price process is a semimartingale X, the mean- variance 
approach suggests to use the quadratic criterion to measure the hedging error, 
i.e. to solve the mean-variance hedging problem introduced by Follmer and 
Sondermann 1101: 



minimize E \ H — x — / 9 t dX t over all 9g8, fl.ll 



o 

where contingent claim if is a .^-measurable square-integrable random vari- 
able (r.v.), x is an initial investment, G is a class of admissible trading strate- 
gies, T is an investment horizon. 

The mean- variance formulation by Markowitz [26], provides a foundation for 
a single period portfolio selection (see, also Merton [27]). In recent paper of Li 
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and Ng [22] the concept of Markowitz's mean- variance formulation for finding 
the optimal portfolio policy and determining the efficient frontier in analytical 
form has been extended to multiperiod portfolio selection. 

As it pointed out in Li and Ng |22j the results on multiperiod mean-variance 
formulation with one riskless asset can be derived using the results of the mean- 
variance hedging formulation. 

Therefore, the mean- variance hedging is s powerful approach for both above 
mentioned major problems. 

The problem (11. ip was intensively investigated in last decade (see, e.g., Du- 
fiie and Richardson [9], Schwezer [35] . [37] . [55] . Delbaen et al. [8], Monat 
and Striker [28j, Rheinlander and Schweizer [33], (RSch hereafter), Pham et 
al. [31], Gourieroux et al. [11] (GLP hereafter), Laurent and Pham [18]). 

A stochastic volatility model, proposed by Hull and White [13] and Scott 
[59] , where the stock price volatility is an random process, is a popular model 
of incomplete market, where the mean- variance hedging approach can be used 
(see, e.g., Laurent and Pham [18] , Biagini et al. [13], Mania and Tevzadze [2lj . 
Pham et al. [3lJ). 

Consider the stochastic volatility model described by the following system 
ofSDE 

dX t = X t dR tl X > 0, 

dR t = Ht(Rt, Y t ) dt + adwf, R 

° 2 t = f(Yt), 
dY t = a(t, Y t ; a) dt + e dw u t , Y 

where w = (w R , w a ) is a standard two-dimensional Wiener process, defined on 
complete probability space (Q,^ 7 , P), F w = (J r ^)o<t<T is the P-augmentation 
of the natural filtration = a(w s ,0 < s < t), < t < T, generated 
by w, /(•) is a continuous one-to-one positive locally bounded function (e.g., 
f(x) = e x ), a = (ai, . . . , a m ), m > 1, is a vector of unknown parameters, and 
e, < e <C 1, is a small number. Assume that the system (11.21) has an unique 
strong solution. 

This model is analogous to the model proposed by Renault and Touzi [32J 
(RT hereafter). The principal difference is the presence of small parameter 
e in our model, which due to the assumption that the volatility of randomly 
fluctuated volatility process is small (see, also Sircar and Papanicolau |40j). 
Thus assumption enables us to use the prices of trading options with short, 
nearest to the current time value maturities for volatility process filtration 
and parameter estimation purposes (see below). In contrast, RT [32] needs 
to assume that there exist trading derivatives with any (up to the infinity) 
maturities. 

Important feature of the stochastic volatility models is that volatility process 
Y is unobservable (latent) process. To obtain explicit form of optimal trading 
strategy full knowledge of the model of the process Y is necessary and hence 
one needs to estimate the unknown parameter a = («i, . . . , a m ), m > 1. 
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A variety of estimation procedures are used, which involve either direct sta- 
tistical analysis of the historical data or the use of implied volatilities extracted 
from prices of existing traded derivatives. 

For example, one can use the following method based on historical data. 

Fix the time variable t. From observations X („) , . . . , X („) , = < ■ ■ ■ < 

= t, maxffS — t^} — > 0, as n — > 0, calculate the realization of yield 

3 

t 

process R t = J and then calculate the sum 
o 

n-l 

S n (t) = / |-R,M —R.(n)\ 2 . 
3=0 

It is well-known (see, e.g., Lipster and Shiryaev [23]) that 



S n (t) —* J a 2 ds as n — > oo. 



Since a 2 (u) = f(Y t ) is a continuous process we get 



2, ^_,:„^(* + ^^)-^(*^) 



erf (a?) = lim 

AJ.Q z_A 
t 

where F(t,u) = f a 2 s {ui)ds. 
o 

Hence, the realization (yt)o<t<T of the process K can be found by the formula 
y t = f~\a 2 ), < t < T. 

More sofisticated methods using the same idea can be found, e.g., in Chesney 
et al. [5], Pastorello [30] . 

We can use the reconstructed sample path (y t ), < t < T. to estimate the 
unknown parameter a in the drift coefficient of diffusion process Y. 

The second, market price adjusted procedure of reconstruction the sample 
path of volatility process Y and parameter estimate was suggested by RT [32] . 
where they used implied volatility data. 

We present a quick review of this method, adapted to our model (11.21) . 

Suppose that the volatility risk premium \ u = 0, meaning that the risk from 
the volatility process is non-compensated (or can be diversified away). Then 
the price C t (cr) of European call option can be calculated by the Hull and 
White formula (see, e.g., RT [32]), and Black-Scholes (BS) implied volatility 
a 1 (a) can be found as an unique solution of the equation 

C t {a)=Cf s {a i {a)\ 

where C BS {a) denotes the standard BS formula written as a function of the 
volatility parameter a. 

Here (for further estimational purposes) only at-the-money options are used. 
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Under some technical assumptions (see Proposition 5.1 of RT [32], and Bu- 
jeux and Rochet [23] for general diffusion of volatility process) 

d ^ a K o (1.3) 

(remember that the drift coefficient of process Y depends on unknown param- 
eter a). 

Fix current value of time parameter t, < t < T, and let < T% < T2 < 
■ ■ ■ < Tk-\ < t < Tk be the maturity times of some traded at-the- money 
options. 

Let a\e be the observations of an implied volatility at the time moments 
= % < t\ < ■ • • < i[±] = t, m&x[t e j+1 - tf[ — > 0, as e — > 0. 

Then, using fll.3p . and solving the equation 

a* ( a n , «) = 0% , 

one can obtained the realization {a t e} of the volatility (ert), and thus, using the 
formula y^ = / _1 (5fe), the realization {yt e } of volatility process (Yi), which 
can be viewed as the realization of nonlinear AR(1) process: 

Y t e +i -Y n = a(f jt Y n ;a)(f j+1 - # + e(w$ +i -wty. 

Using the data {yt s } one can construct the MLE a\ of parameter a, see, 
e.g., Chitashvili et al. [25], [26J, Lazrieva and Toronjadze [T9] . 

Remember the scheme of construction of MLE. Rewrite the previous AR(1) 
process, using obvious simple notation, in form 

Y j+1 - Yj = a(tj, Yf, a) A + sAwj. 

Then 

9 „ f „ ,,,, 1 ( (y-Y-a(t v Y;a)A) 2 
— P{Y i+1 < y Y} = - exp ^ Vr 

and the log-derivative of the likelihood process it = (i[ , • • • ? 4"^) is given by 
the relation 



where 



d 

^ (y - Y 3 - a{t h Y f , a)A)a®(t it Y 3 ; a) A. 



Hence MLE is a solution (under some conditions) of the system of equations 
1 

e^A 
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where the reconstructed data {yj} = {yt^} are substituted). 
Following RT [32] let us introduce the functionals 




and 



•e — 



MLE £ oHW~ l . 



Starting with some constant initial value (or preliminary estimate obtained, 
e.g., from historical data) one can compute a sequence of estimates 



If the operator <fi e is a strong contraction in the neighborhood of the true value 
of the parameter a , for a small enough e, then one can define the estimate ct\ 
as the limits of the sequence {ol\ (j))} p >i- It was proved in RT [32] that a\ is a 
strong consistent estimate of the parameter a. 
Return to our consideration. 

Interpolating on some way the corresponding (to the estimate a\ ) realization 
{Vt{\ we § e t the reconstructed continuous sample path (y s )o< s <t of the latent 
process Y, which can be used for further analysis. 

Unfortunately, both described statistical procedures are highly sensitive 
w.r.t errors in all steps of parameter identification process. 

Hence, this is a natural place for introducing the robust procedure of pa- 
rameter estimates. 

Suppose that the sample path (y s )o< s <t comes from the observation of pro- 
cess {Y s ) < s < t with distribution P £ from the shrinking contamination neigh- 
borhood of the distribution P £ of the basic process Y = (Y s ) < s < t . That is 



where N £ = (iV|) < s <t is a P^-square integrable martingale, S t (M) is the 
Dolean exponential of martingale M. 

In the diffusion- type framework ( 11. 4ft represents the Huber gross error model 
(as it explain in Remark 2.2). The model of type ( 11.41) of contamination of 
measures for statistical models with filtration was suggested by Lazrieva and 
Toronjadze [20], [21]. 

In Section 2 we study the problem of construction of robust estimates for 
contamination model (11.41) . 

In subsection 2.1 we give a description of the basic model and definition of 
consistent uniformly linear asymptotically normal (CULAN) estimates, con- 
nected with the basic model (Definition 2.1). 



a £ t (p+l) = (P £ (a £ t (p)), p>l. 



dP e a 



= £ t {eN e ) 



(1.4) 
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In subsection 2.2 we introduce a notion of shrinking contamination neigh- 
borhood, described in terms of contamination of nominal distribution, which 
naturally leads to the class of alternative measures (see (12.181) and (12.191) ). 

In subsection 2.3 we study the asymptotic behaviour of CULAN estimates 
under alternative measures (Proposition 2.2), which is the basis for the formu- 
lation of the optimization problem. 

In subsection 2.4 the optimization problem is solved which leads to con- 
struction of optimal i?-robust estimate (Theorem 2.1). 

Based on the limit theorem (subsection 2.1), one can construct the asymp- 
totic confidence region of level 7 for unknown parameter a 

lim PI [e-\a - afflV-W; - a*' e ) < x *)=l- 7 , 

where is a quantile of order 1 — 7 of ^-distribution with m degree of 
freedom, and V(ip*;a) is given by (I2.17p . 

This region shrinks to the estimate al'°, as e —>■ 0. 

Now if the coefficient a(t,y;a) in (II .2!) is such that the solution Y t £ (a) of 
SDE (II .2!) is continuous w.r.t parameter a (see, e.g., Krylov [IS]), then the 
confidence region of parameter a is mapped to the confidence interval for 
Y t £ (a) } which shrinks to Y t * = Y^°(a:*' ), Further, by the function /, the latter 
interval is mapped to the confidence interval for a t , which shrinks to o 4 * = 
/V 2 (y t °(c^' -jy Denote of the center of this interval. Then the interval can be 
written in the form 

a t = of + 5(e)ht, 

where 5(e) — > 0, as e — > 0, and h E H (see (I3.18I) ). 

Thus, we arrive at the asset price model (ll. 2D with misspecified volatility, 
and it is natural to consider the problem of construction of the robust trading 
strategy to hedge a contingent claim H. 

We investigate this problem in the mean- variance setting in Section 3. We 
consider the general situation, when the asset price is modelled by <i-dimen- 
sional continuous semimartingale and the information structure is given by 
some general filtration. 

In subsection 3.1 we give a description of the financial market model. 

In subsection 3.2 we collect the facts concerning the variance-optimal equiv- 
alent local martingale measure, which plays a key role in the mean-variance 
hedging approach. 

In last subsection 3.3 we construct "optimal robust hedging strategy" (The- 
orem 3.1) by approximating the optimization problem (13.251) by the problem 
(13.27p . As it is mentioned in Remark 3.2, such approach and term are com- 
mon in robust statistic theory. In contact to optimal 5-robustness (see Sec- 
tion 2), here we develop the approach, known in robust statistics as optimal 
^-robustness, see Hampel et al. [12]. 

Note that our approach allows incorporating current information on the 
underlying model, and hence is adaptive. Namely, passing from time value 
t to t + r, r > 0, when more information about market prices are available, 
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the asymptotic variance-covariance of the constructed estimate a* t ,£ becomes 
smaller, and hence the estimation procedure becomes more precise. 

In the paper of Runggaldier and Zaccaria [35] the adaptive approach to risk 
management under general uncertainty (restricted information) was developed. 
As it is mentioned in this paper there exist a series of investigations dealt with 
various type of adaptive approaches (see list of references in |35j). But in all 
these papers (except Runggaldier and Zaccaria [35]) the uncertainty is only in 
the stock appreciation rate in contrast to our consideration, where the model 
misspecification is due to the volatility parameter. 

The consideration of misspecified asset price models was initiated by Avella- 
neda et al. pQ, Avellaneda and Paras [2J. 

Various authors in different settings attacked the robustness problem. The 
method used in Section 3 was suggested by Toronjadze [JT] for asset price pro- 
cess modelled by the one-dimensional process. As it will be shown in Remark 
3.2 below, in simplest case when asset price process is a martingale w.r.t ini- 
tial measure P, and it is possible to find the solution of "exact" optimization 
problem (I3.25p . this solution coincides with the solution of an approximating 
optimization problem (13.271) . In more general situation (when asset price pro- 
cess is not more the P-martingale) investigation of the problem (13.251) by, e.g., 
control theory methods seems sufficiently difficult. Anyway, we do not know 
the solution of the problem (13.251) . 

Return to the stochastic volatility model (II. 2p and describe successive steps 
of our approach: 

1) For each current time value t, < t < T, reconstruct the sample path 
{y s )o< s <t, using the historical data or the tradable derivatives prices; 

2) Using the approach developed in Section 2, calculate the value ot* t ' e of 
the robust estimate of parameter a (i.e. construct the deterministic function 
t — > a* t ,£ G R m ) and then find the confidence region for parameter a; 

3) Based on the volatility process model find the confidence interval for 



4) Denoting a*(t,y) = a(t,y; a^ ,£ ), where a(t,y;a) is a drift coefficient of 
volatility process, consider the stochastic volatility model with misspecified 
asset price model and fully specified volatility process model 



dM? = k t dt + dwf, 

h G 7i and of is the center of the confidence interval of volatility. 

Using Theorem 3.1 construct the optimal robust hedging strategy by the 




dY t 



dX t = X t dR u X >0, 
dR t = (of + 8(e)h t )dM?, R = 0, 
t = a*{t,Y t )dt + edw°, Y = 0, < t < T, 



where 



formula (13 .44p 



l r 



t 



s 
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where all objects are defined in Theorem 3.1. □ 
It should be mentioned that if one constructs a hedging strategy 8* by 
the above-given formula with a\ ,e = f l / 2 {Y^{a* t ' e )) instead of of, then the 
strategies Q\ and Q\ would be different, since o 4 *' £ ^ of, in general. Hence the 
value A t = |o t *' £ — of | defines the correction term between the robust, Q\ and 
non- robust, 9* strategies. 

In nontrivial case, when k t = k(Y t ) the variance-optimal martingale mea- 
sure P is given by (I3.17p . Q = —k t £t(—k ■ M°) (see subsection 3.2), and the 
process (X t , Y t )o<t<T is the Markov process. If H = h(Xx, Yp) (h(x, y) is some 
function), then V t H = E p (H\f™) = E p (h(X T ,Y T )\F™) = v(t,X T ,Y T ) and if, 
e.g., v(t,x,y) G C 1,2 ' 2 , then v is an unique solution of the following partial 
differential equation 

dv „ dv 1 / 2 d 2 v 2 2 d 2 v ^ 
dt dy 2 V dy 2 dx 2 



with the boundary condition v(T,x,t) = h(x,y). More general situation with 
nonsmooth v is considered in Laurent and Pham [18], Mania and Tevzadze 



Further, one can find the Galtchouck-Kunita-Watanabe decomposition of 
r.v. H (see, e.g., Pham et al. [31]) putting 



and calculate ip]r , Lt and V t * using (4.13) and (4.14) of RSch [33] . 

Thus one get the explicit solution of the mean- variance hedging problem. 
Finally, here is the short summary of approach: 

a) Incorporate the robust procedure in statistical analysis of volatility pro- 
cess. That is construct and use in the model optimal 5-robust estimate of 
unknown parameter in drift coefficient of volatility process. 

Parameter estimation naturally leads to the asset price model misspecifica- 
tion. 

b) Incorporate the second robust procedure in financial analysis of contin- 
gent claim hedging. That is construct and use for hedging purposes optimal 
V^-robust trading strategy. 

In our opinion this "double robust" strategy should be more attractive to 
protect the hedger against the possible errors. 

The general asymptotic theory of estimation can be found in Ibragimov and 
Khas'miskii [T3]; the theory of robust statistics is developed in Hampel et al. 
[T2] and in Rieder |34] : the theory of the trend parameter estimates for diffusion 
process with small noise is developed in Kutoyants [17]; the book of Musiela 
and Rutkowsky [29] is devoted to the mathematical theory of finance and 
finally, the general theory of martingales can be found in Jacod and Shiryaev 
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2. Optimal B-Robust Estimates 

2.1. Basic model. CULAN estimates. The basic model of observations is 
described by the SDE 

dY s = a(s,Y;a)ds + edw s , Y = 0, < s < t, (2.1) 

where t is a fixed number, w = {w s )o< s < t is a standard Wiener process de- 
fined on the filtered probability space (fi, J 7 , F = {J r s )o<s<t, P) satisfying the 
usual conditions, a = («i, . . . ,a m ), m > 1, is an unknown parameter to be 
estimated, a G A C R m , A is an open subset of R m , e, < e < 1, is a small 
parameter (index of series). In our further considerations all limits correspond 
to e -> 0. 

Denote (C t ,B t ) a measurable space of continuous on [0,t] functions 
x — ( x s)o<s<t with a-algebra B t = cr(x : x s , s < t). Put B s = cr(x : x u , u < s). 

Assume that for each a & A the drift coefficients a(s,x;a), < s < t, 
x G Ct is a known nonanticipative (i.e. ^-measurable for each s, < s < t) 
functional satisfying the functional Lipshitz and linear growth conditions L: 

s 

\a(s, x 1 ; a) — a(s, x 2 ; a)\ < L 1 J \x\ — x*\ dk u + L 2 \xl — xl\, 

o 

s 

\a(s,x; a)\ <L X J (1 + \x u \) dk u + L 2 (l + \x s \), 
o 

where L\ and L 2 are constants, which do not depend on a, k = (k(s))o< s <t 
is a non- decreasing right-continuous function, < k(s) < k Q , : k Q < oo, 
x x ,x 2 G C t . 

Then, as it is well-known (see, e.g., Liptser and Shiryaev [23]), for each 
a G A the equation (12.11) has an unique strong solution Y £ (a) = (Y £ (a)) < s < t , 
and in addition (see Kutoyants [T7] ) 

sup \Yg(a) — Yg(a)\ <Ce sup \w s \ P-a.s., 

0<s<t 0<s<t 

with some constant C = C(Li, L 2 , k ,t), where Y°(a) = (F s (a)) < s <t is the 
solution of the following nonperturbated differential equation 

dY s = a(s,Y;a)ds, Y = 0. (2.2) 

Change the initial problem of estimation of parameter a by the equivalent 
one, when the observations are modelled according to the following SDE 

dX s = a e (s, X; a) ds + dw s , X = 0, (2.3) 

where a e (s, x;a) = ^ a(s, ex; a), < s < t, x G C t , a G A. 

It is clear that if X £ (a) = (Xf (a)) < s <t is the solution of SDE (12.31) . then 
for each s G [0, t] sX s s (a) = Y s £ {a). 

Denote by P £ the distribution of process X £ (a) on the space (C t , B t ), i.e. P £ 
is the probability measure on (C t ,B t ) induced by the process X £ (a). Let P w 
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be a Wiener measure on (C t ,B t ). Denote X = (X s ) < s < t a coordinate process 
on (C t , B t ), that is X s {x) = x s , x G C t . 

The conditions L guarantee that for each a G A the measures P^ and 
are equivalent (P^ ~ P w ), and if we denote zf ,e = ^^\B S the density process 
(likelihood ratio process), then 

Zs' £ (X) = £ s (a £ (a) ■ X) := exp < J a £ (u, X; a) dX u — ^ ^ a 2 (w, X; a) dw >. 

^ o o ' 

Introduce class ^ of i? m -valued nonanticipative functionals ip, ip : [0,t] x 
C t x ^4 — > i? m such that for each aei and e > 

t 

1) y |^(s,X;a)| 2 d S <oo, (2.4) 
o 

t 

2) J \ij(s,Y° (a); a)\ 2 ds < oo, (2.5) 
o 

3) uniformly in ct on each compact K C A 

t 

i^-Um / |^(s,eX;a) - ^(s, F°(a); a)| 2 ds = 0, (2.6) 



where | • | is an Euclidean norm in R m , P^ — lim ( £ = ( denotes the convergence 

p a{\(e - CI > P} ^ 0, as £ ^ 0, for all p, />°0. 

Assume that for each s G [0,t] and x E C t the functional a(s,x; a) is differ- 
entiable in a and gradient d = a, . . . , a) belongs to ^ (d G where 
the sign '"" denoted a transposition. 

Then the Fisher information process 

s 

Is(X; a) :— J a £ (u, X; a)[a e (u, X; a)]'du, 0<s<t, 
o 

is well-defined and, moreover, uniformly in a on each compact 



^-lime 2 /f (a) =i°(a), (2.7) 

e^0 



where 
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For each if) 6 introduce the functional if) £ (s,x;a) := - tf)(s, ex; a) and 
matrices Tf(a) and jfa: 

t 

Tf(X;a) := J if) E (s, X; a)[iP £ (s, X; a)]'ds, (2.8) 
o 

t 

-ff(X;a):= J if) £ (s, X; a)[d £ (s, X; a)\'ds. (2.9) 



o 



Then from (12. 6p it follows that uniformly in a on each compact 

P^-\ime 2 Tt(a) = Tt(a), (2.10) 



£^0 



^-lime 2 7?(a)= 7 ?(a), (2.11) 



where the matrices Tq (a) and 7o (ex) are defined as follows 

f 

r|f(a) := /"^(s,^°(a);a)[^(s,^ (a);a)] , rfs, (2.12) 



o 

f 



7?(a) := / ^(s,^°(a);a)[d(s,F (a);a)] , rfs. (2.13) 



Note that, by virtue of (El), (ESI) and d £ f , matrices given by (EE}, (BSD , 
(EZD and (I2TT3D are well-defined. 

Denote \l/o the subset of \l/ such that for each if) e V&o an d ex 6 .4., rank (cx) = 
m and rank 7^ (ex) = m. 

Assume that fl6f . 

For each if) e \l>o, define a P^-square integrable martingale L^ ,e (cx) = 
(Lf' e (a))o< s <t as follows 

s 

L^' £ (X;a) = J ip £ (u,X;a)(dX u - a £ (u, X; a) du). (2.14) 


Now we give a definition of CULAN M-estimates. 

Definition 2.1. An estimate (af ,£ ) £>0 = (exff , . . . , «^' e Jg >0 , ^ G \I/o, is called 
consistent uniformly lineal asymptotically normal (CULAN) if it admits the 
following expansion 

4' e = a+ [ 7 o^a)rVLf' £ (a) + r„», (2.15) 

where uniformly in cx on each compact 

P £ a -lim £"%,,(«) =0. (2.16) 
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It is well-known (see Lazrieva, Toronjadze [19]) that under the above condi- 
tions uniformly in a on each compact 

C{s-\at £ -a)\P^}^N(0,V(i;;a)), 

with 

V{^a) := [7o^(«)]- 1 ^(a)([7o V, («)]- 1 ) , ) (2.17) 
where C((\P) denotes the distribution of random vector ( calculated under 
measure P, symbol "— >" denotes the weak convergence of measures, 
iV(0, V(ifj; a)) is a distribution of Gaussian vector with zero mean and co- 
variance matrix V(tp; a). 

Remark 2.1. In context of diffusion type processes the M-estimate (af' e ) £ >o is 
defined as a solution of the following stochastic equation 

L^(X;a) = 0, 

where Lf £ {X; a) is defined by flgZEg) , ip G tf - 

The asymptotic theory of M-estimates for general statistical models with 
filtration is developed in Chitashvili et al. [7]. Namely, the problem of existence 
and global behaviour of solutions is studied. In particular, the conditions of 
regularity and ergodicity type are established, under which M-estimates have 
a CULAN property. 

For our model, in case when A = R m , the sufficient conditions for CULAN 
property take the form: 

(1) for all s, < s < t, and x e C t the functionals ip(s, x\ a) and a(s, x; a) 
are twice continuously differentiable in a with bounded derivatives satisfying 
the functional Lipshitz conditions with constants, which do not depend on a. 

(2) the equation (w.r.t y) 

t 

A(a, y) := J ^(s, Y°{a); y)(a(s, Y°(a); a) - a(s, Y°(a); y)) ds = 0, 
o 

has an unique solution y = a. 

The MLE is a special case of M-estimates when ip = a. 

Remark 2.2. According to (12. 7p the asymptotic covariance matrix of MLE 
(af) e >o is [7o(«)] _1 - By the usual technique one can show that for each a G A 
and ip e ^o, Iq (a) < V(il), a) (see (I2.17P ). where for two symmetric matrices 
B and C the relation B < C means that the mattix C — B is nonnegative 
definite. 

Thus, the MLE has a minimal covariance matrix among all M-estimates. 

2.2. Shrinking contamination neighborhoods. In this subsection we give 
a notion of a contamination of the basic model (12.31) . described in terms of 
shrinking neighborhoods of basic measures {P&, ct e A, e > 0}, which is an 
analog of the Huber gross error model (see, e.g., Hampel et.al. [12] and also, 
Remark 12.31 below) . 
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Let Tt be a family of bounded nonanticipative functional h : [0, t] x Ct X -4 — > 
P 1 such that for all s G [0, £] and a£i the functional h(s, x; a) is continuous 
at the point x = Y°(a). 

Let for each h G Tt, a G A and £ > 0, P^ h be a measure on (C t , B t ) such 
that 



dP^ h 



V) p e > h ~ p £ 



2) ^ = (2 ' 18) 



where 



3) N** := I h s (u,X;a)(dX u - a £ (u,X;a)du), (2.19) 



o 



with h e (s, x; a) := - h(s, ex; a), < s < t, x G C t . 
Denote P^ n a class of measures P^' h , h G Tt, that is 

P* n = {!**■, hen}. 

We call (P^,' w ) e >o a shrinking contamination neighborhoods of the basic mea- 
sures (Pq) £> o, and the element (P^' h ) £> o of these neighborhoods is called al- 
ternative measure (or simply alternative). 

Obviously for each h G Tt and a G A, the process N^ h = {N^) < s < t defined 
by (I2.19P is a P^-square integrable martingale. Since under measure P% the 
process w = {w s ) < s < t defined as 



w s := X s — J a £ (u, X; a) du, < s < t, 
o 

is a Wiener process, then by virtue of the Girsanov Theorem the process w := 
w + (w, eN^ h ) is a Wiener process under changed measure P^ ,h . But by the 
definition 



X s — J (a £ (u, X; a) + eh £ (u, X; a)) du, 



o 



and hence, one can conclude that P^ h is a weak solution of SDE 
dX s = (a £ (s, X; a) + eh £ (s, X; a)) ds + dw s , X = 0. 
This SDE can be viewed as a "small" perturbation of the basic model (12.31) . 

Remark 2.3. 1) In the case of i.i.d. observations X\,X2, . . . ,X n , n > 1, the 
Huber gross error model in shrinking setting is defined as follows 

f n,h (x; a) := (1 - e n )f(x; a) + e n h(x; a), 

where f(x; a) is a basic (core) density of distribution of r.v. X,- t (w.r.t some 
dominating measure /i), h(x;a) is a contaminating density, f n,h (x; a) is a con- 
taminated density, e n = 0(n -1 ' 2 ). If we denote by P™ and P™' h the measures 
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on (R n ,B(R n )), generated by f(x;a) and f n ' h (x;a), respectively, then 

Apn,h n fn,h(Y- n \ _ n _ 

- n = na + ^ «» - «.<«. ■ 

m 

where H = N% h = (A^)i< m < n , N%* = a), N™' h is a P«- 



martingale, £ n (e n N™' h ) — FJ(1 + e n AN2\ ) is the Dolean exponential in dis- 

i=i 

crete time case. 
Thus 

ripn,h 

-^ = £{en-N^), (2.20) 

and the relation (12.181) is a direct analog of (12.201) . 

2) The concept of shrinking contamination neighborhoods, expressed in the 
form of (I2.18P was proposed in Lazrieva and Toronjadze [20] for more general 
situation, concerning with the contamination areas for semimartingale statis- 
tical models with filtration. □ 

Note here that the power of the small parameter e is crucial. One cannot 
consider the perturbation of measure with different power of e if he/she wish 
to get nontrivial result. 

In the remainder of this subsection we study the asymptotic properties of 
CULAN estimates under alternatives. 

For this aim we first consider the problem of contiguity of measures (P^ ,h ) £> o 
to(P Q £ ) £ >o- 

Let (e n )n>ij £ n I 0, and (a n ) n >i, a n G K, K C A is a compact, be arbitrary 
sequences. 

Proposition 2.1. For each h G Ti the sequence of measures (P^Z' 11 ) ^ s con ~ 
tiguous to sequence of measures (P^ n ), i-e. 

(p*:' h ) < (ps:)- 

Proof. From the predictable criteria of contiguity (see, e.g., Jacod and Shiryaev 
[To]), follows that we have to verify the relation 

lim limsupP^'' 1 I J >ivl=0, (2.21) 



where h n {\) = (^g(|))o<s<t is the Hellinger process of order |. 

By the definition of Hellinger process (see, e.g., Jacod and Shiryaev [15]) we 
have 

t 

K = K (^-,P £ -\p£) =lj [h(s,e n X-a n )] 2 ds, 

o 

and since h G TC, and hence is bounded, h™(^) is bounded too, which provides 
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Proposition 2.2. For each estimate (c4'^) e >o with ip G \&o an d each alterna- 
tive (Pu' h ) £ >o G (P £ a H )e>o the following relation holds true 

£ {e-\at' E - a) \ P^ h ) A N ([ 7 J '(a)]"^, h; a), Vty, a)) , 

where 

t 

o 

Proof. Proposition 12.11 together with (12.161) provides that uniformly in a on 

each compact 



P£ h -lime-%, e (a) = 0, 



and therefore we have to establish the limit distribution of random vector 
[7o (a)] ^eLf' 6 under the measures (P £,h ) £> o. 

By virtue of the Girsanov Theorem the process L^ ,£ (a) = (Lf' e (a))o< s <t is 
a semimartingale with canonical decomposition 

Lf%a) = Lf £ (a) + b e>a (1>, h;a), 0<s<t, (2.22) 

where L^ ,e (a) = (Lf> £ (a))o< s < t is a P^-square integrable martingales defined 
as follows 

s 

Lf' £ (X; a) := J ip £ (u, X; a) (dX u — (a e (u, X; a) + eh £ (u, X; a)) du, 
o 

and 

s 

U^;«):= E /«a;»)Ma;«l*. 



But (Z^ £ (a))t = If (a), where Tf{a) is defined by ([22]). On the other hand, 
from Proposition 12.11 and (I2.10p it follows that 



- \im(eL^(a)) t = Pt - lime 2 r^(a) = P £ - iWrfta) = 

e^O e— >0 e— >0 

uniformly in a on each compact, and hence 

c{hi{oc)\- l eLt*{a) \ P £ h } A iV(0, Vty; a)). (2.23) 

Finally, relation (I2.23P together with (12.221) and relation 

t 

P £ Q h — \imeb £tt (^>, h; a) = J ip(s, Y°(a); a)h(s, Y°(a); a) ds = b(ip, h; a), 

o 

provides the desirable results. □ 
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2.3. Optimization criteria. Construction of optimal B-robust esti- 
mates. In this subsection we state and solve an optimization problem, which 
results in construction of optimal S-robust estimate. 

Initially, it should be stressed that the bias vector b(ip, h;a) := [jq (a)] -1 x 
b(ip, h; a) can be viewed as the influence functional of the estimate (af ' e ) e >o 
w.r.t. alternative (P$' h ) e> Q. 

Indeed, the expansion (12.151) together with (12.221) and (12.231) allows to con- 
clude that 

£ {e-\af e -a- ^(a)]" 1 ^, h; a)) \ P^ h ) ^ N(0, Vty, a)), 
and, hence, the expression 

a + £2 ho ( a )}~ lb e(ip, h;a)-a = e 2 [^(a)]' l b £ (i), h; a), 

plays the role of bias on the "fixed step e" and it seems natural to interpret 
the limit 

as the influence functional. 

For each estimate (af' e ) e> o, ip G ^o, define the risk functional w.r.t. alter- 
native (-P„ h ) e >o, h G H, as follows: 

D(if>, h; a) = lim lim E £ a h ( {e~ 2 \af £ — a\ 2 ) A a 

where x A a = min(x, a), a > 0, E^ h is an expectation w.r.t. measure P £ ' h . 
Using Proposition 12.21 it is not hard to verify that 

D(ip,h;a) = \b(i/j, h; a)\ 2 + tr V(ip, a), 

where tr A denotes the trace of matrix A. 
By Proposition 12.21 

s-^af' 6 -a) N(b(il),h;a),V(ip;a)), 

where —> denotes the convergence by distribution (by distribution P £,h in our 
case), N(b, V) is a Gaussian random vector with mean b and covariation ma- 
trix V. 

But if £ = . . . , £ m )' is a Gaussian vector with parameters (//, a 2 ), then 

mm m 

m\ 2 = J2 E & = E(^) 2 + E D & = h 2 + tr - 2 > 

i=l i=l i=l 

as it was required. 

Connect with each if) G ty the function ip as follows 

if)(s, x; a) = [7q (oj)] _1 z/)(s, x; a), 0<s<t, x G C t , a G A. 

Then ip G an d 

7? (a) = Id, 
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where Id is an unit matrix, 

V(ip; a) = V(ip] a) = Tq (a), b(ip, h; a) = b(ip, h\ a) = b(ip, h; a). 
Therefore 

D(ip,h;a) = D(^,h;a) = \b$, h; a)\ 2 + tr if (a). (2.24) 
Denote H r , a set of functions h G TC such that for each a G A 



r 

/| fc (.,y(«)i«)l*<r, 



o 

where r, r > 0, is a constant. 
Since, for each r > 0, 

sup |5(^,/i;a)| < const(r) sup ^(s, y°(a); a)|, 

heHr 0<s<t 

where constant depends on r, we call the function ip an influence function of 
estimate (af ,£ ) E>0 and a quantity 



7^(a) = sup |^(s,y°(a);a)| 

0<s<t 

is named as the (unstandardized) gross error sensitivity at point a of estimate 

(af' e ) e >o- 
Define 



t 

# 0)C = jv>e^ : J 4>(s,Y (a)]a)ld(s,Y (a);a)]'ds = Id, (2.25) 



7 ;(«)<c|, (2.26) 

where c G [0, oo) is a generic constant. 

Take into account the expression (12.241) for the risk functional we come to the 
following optimization problem, known in robust estimation theory as Ham- 
pel's optimization problem: minimize the trace of the asymptotic covariance 
matrix of estimate (af ,£ ) e> o over the class ^o tC , that is 



minimize 



V>(s, Y°(a); a)[tp(s, Y°(a); a)]'ds (2.27) 



o 



under the side conditions (12.251) and (I2.26p . 

Define the Huber function h c (z), z G R m , c > 0, as follows 



hJz) := zmin I 1, — 



\z\ 

For arbitrary nondegenerate matrix A denote ip^ = h c (Aa). 
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Theorem 2.1. Assume that for given constant c there exists a nondegenerate 
m x m-matrix A* (a), which solves the equation (w.r.t. matrix A) 
t 

J ^(s,Y°(a);a){a(s,Y°(a);a)}'ds = Id. (2.28) 
o 

Then the function if>f c ^ = h c (A*(a)d) solves the optimization problem 



Proof We follow Hampel et al. [12] . 
Let A be an arbitrary m x m-matrix. 

Since for each ip e ^0,0 / i>{a>)' — Id, J a[a]' = I°(a) (see (12. 7p ). then 
(ip - Aa)(ip - Ad)' = I ifufj' -A — A' + AI°(a)A' 



(here and below we use simple evident notation for integrals). 

Therefore since the trace is an additive functional instead of minimizing of 
tr J ipif)' we can minimize 

tr j{ip- Aa)(ip - AaY = J \ip-Ad\ 2 . 
Note that for each z 

argmin \z — y\ 2 = h c (z). 
\y\<o 

Indeed, it is evident that minimizing y has the form y = /3z, where /3, < (3 < 
1, is constant. Then 

min \z — y\ 2 = min (1 — 3) 2 \z\ 2 . 

\y\<c yi /3<^ V M; 1 1 



Thus we have to find 

arg min (1 — j3) 2 = min ( 1, — ) . 

/3<TTT V \Z\) 



FT 

But last relation is trivially satisfied. Hence the minimizing y* = zmin(l, A) 
and 

arg min \ip — Ad\ 2 = h c (Aa). 

\ip\<c 

From the other side, 

\h c (z)\ 2 = \z\ 2 I{\ z \< c } + c 2 I(\ z \>c) < c 2 . 

Hence 

|^c(- 2; )| < c for all z 
and therefore h c (Aa) satisfies the condition (I2.26P for each A. 

Now it is evident that a function h c (Aa) minimizes the expression under 
integral sign, and hence the integral itself over all functions if) G ^0 satisfying 

(E2HI). 
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At the same time the condition ( 12.25jl . generally speaking, can be violated. 
But, since a matrix A is arbitrary, we can choose A = A* (at) from (12.281) which, 

of course, guarantees the validity of (12.251) for ip* = ipc c ^- □ 

As we have seen the resulting optimal influence functions ip* is defined along 
the process Y°(a) = (F s °(a))o< s <t, which is a solution of equation (12.21) . 

But for constructing optimal estimate we need a function ip* (s, x; a) defined 
on whole space [0, t] X C t x A. 

For this purpose define ip*(s,x; a) as follows; 

Tp* c (s,x;a) = ipf°W(s,x;a) = h e (A* c (a)a(s,x;a)), (2.29) 
and as usual ?p* i£ (s, x; a) = ^4>*(s, ex; a), < s < t, x G C t , a G A. 

Definition 2.2. We say that vp*(s, x; a), < s < t, x G C t , a G A, is an 
influence function of optimal 5-robust estimate (a* t ,£ ) £> Q = (af c ' £ ) £> o over 
the class of CULAN estimates (af' £ ) e> o, ip G ^o,c if the matrix A* (a) is 
differentiable in a. 

From (J23D, (EUD, and ([ZggD it directly follows that 

t 

7 p(a) = P £ a -\mve 2 ^ (a) = [ xp*(s,Y°(ct); a)(a(s,Y°(a); a))'ds = Id. 

etoO J 


Besides, for each alternative (P^' h ) £> o, h G H, according to the Proposition 
we have 



C {e-^a/ - a) | P £ ' h } ^ N(b(^* CJ h; a), V(ip* c ; a)) as e -> 0, 

where 

t 

b(ip*, h;a)= [ ip;(s, Y°(a); a)h(s, Y°(a); a) ds, 



and V(ip*;a) = Tp(a). 

Hence, the risk functional for estimate (a* t ' £ ) £> Q is 

D(iP* c ,h;a) = \b(^*,h;a)\ 2 + trTf , h G H, 
and the (unstandardized) gross error sensitivity of (a* t ,£ ) £> o is 

irM) = su p l^c( s ^°(«);")l < c - 

0<s<t 

From above reasons, we may conclude that (a^ ,£ ) £> Q is the optimal 5-robust 
estimate over the class of estimates (af ,e ) e >o, ip G in the following sense: 
the trace of asymptotic covariance matrix of (al' £ ) £> o is minimal among all 
estimates (otf ,£ ) £> o with bounded by constant gross error sensitivity, that is 

rf(a)<rj(a) for all ^ G ^ , c • □ 
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Note that for each estimate (af ,£ ) e >o and alternatives (P^' h ) e> o, h G H, the 
influence functional is bounded by const(r) ■ c. Indeed, we have for tp G ^o,c 

sup \b(ip, h;a)\ < const{r) ■ c := C(r, c), 
heH r 

and since from (12.241) 

inf sup D(ip,h;a) < C 2 (r, c) + tr rf (a), 

we can choose "optimal level" of truncation, minimizing the expression 

C 2 (r,c) +trrf («) 

over all constants c, for which the equation (12.281) has a solution A* (a). This 
can be done using the numerical methods. 

For the problem of existence and uniqueness of solution of equation d 2 . 2 8 [) 
we address to Rieder [34] . 

In the case of one-dimensional parameter a (i.e. m = 1) the optimal level 
c* of truncation is given as an unique solution of the following equation (see 
Lazrieva and Toronjadze [20], [2 1 J ) 




o 



where [x] b a = (x A b) V a and the resulting function 

ip*(s, x; a) = [a(s, x; a)]l c », < s < t, x G C t , 

is (^Q,Ti. r ) optimal in the following minimax sense: 

sup D(ip*,h;a) = inf sup D(ip,h;a). 
hen,- ^ heHr 

3. Optimal Mean- Variance Robust Hedging 

3.1. A financial market model. Let (fi,JF, F = (J r t)o<t<T, P) be a filtered 
probability space with filtration F satisfying the usual conditions, where T G 
(0, oo] is a fixed time horizon. Assume that is a trivial and Tt = 3~ '■ 

There exist d+ 1, d > 1 primitive assets: one bound, whose price process is 
assumed to be 1 at all times and d risky assets (stocks), whose R d - valued price 
process X = (X t )o<t<T is a continuous semimartingale given by the relation: 

dX t = diag(X t ) dR t , X >0, (3.1) 

where diag(X) denotes the diagonal d x <i-matrix with diagonal elements 
X 1 , . . . ,X d , and the yield process R = (Rt)o<t<T is a -Revalued continuous 
semimartingale satisfying the stricture condition (SC). That is (see Schweizer 

dR t = d(M) t \ t + dM t , R = 0, (3.2) 
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where M = (M t )o< t <T is a -Revalued continuous martingale, M G M.1 Xoc (P), 
^ — (At)o<t<r is a F-predictable .Revalued process, and the mean-variance 
tradeoff (MVT) process JC = (JC t )o<t<T of process R 
t 

jC t ■= J \' s d(M) s \ s = (A' • M) t < oo P-a.s., t G [0,T]. (3.3) 

Q 

Remark 3.1. Remember that all vectors are assumed to be column vectors. 
Suppose that the martingale M has the form 

M = a-M, (3.4) 

where M = (M t ) < t < T is a -Revalued continuous martingale, M G -Mq 1oc (P) 
with d(M\Mi) t = lf* d dC u I dxd is the identity matrix, C = (C t ) < t < T is a 
continuous increasing bounded process with Co = 0. 

Further, let o = (cr t )o<t<T is a d x <i-matrix valued, F-predictable process 
with rank(cr f ) = d for any t, P-a.s., the process (o"^ 1 )o<t<r is locally bounded, 
and 



J a t d{M) t a' t < oo P-a.s. 
o 



(3.5) 



Assume now that the following condition be satisfied: 

There exist fixed i? d -valued, F-predictable process k = (k t )o<t<T such that 

A = A(o-) = (a'Y l k. (3.6) 

In the case from (13. 2\\ we get 

dR t = d(M) t \ t + dM t = atdiM^a'M^kt + o t dM t 

= a t (d(M) t k t + dM t ), (3.7) 

and 

t t 

K t = f \' s d{M)sK = [ ^(Kr^Mw^Kr 1 ^ 





t 



k' t d{M) t k t = (k ■ M) t := JCt. 



o 



From (13.31) we have 

JC t < oo P-a.s. for all t G [0,T]. (3.8) 

Thus, of we introduce the process M° = (M 4 °) <t<T by the relarion 

dM t ° = d{M) t k t + dM t , K = 0, (3.9) 

then the MVT process JC = (JC t )o<t<T of -Revalued semimartingale M° is 
finite, and hence M° satisfies SC. 
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Finally, the scheme (IOi . (M . Q . (M and (EHD can be rewritten in 
the following form 

dX t = diag(X i ) dRt, X > 0, 

dRt = a t dM?, Ro = 0, (3.10) 
dM° = d(M) t k t + dM t , M = 0, 

where a and k satisfy (13.51) and (13. 8ft . respectively. 
This is our financial market model. 

3.2. Characterization of variance-optimal ELMM (equivalent local 
martingale measure). A key role in mean- variance hedging plays variance- 
optimal ELMM (see, e.g., RSch [33], GLP [H]). Here we collect some facts 
characterizing this measure. 

We start with remark that the sets ELMMs for processes X, R and M form 
(I3.10P coincide. Hence we can and will consider the simplest process M°. 

Introduce the notation 

M e 2 := |q ~ P : E L 2 (P), M° is a Q-local martingale | , 

and suppose that 

(cl) M e 2 ± 0. 

The solution P of the optimization problem 

E£*(M Q ) -> inf 

QeM? 2 



= £t{M q ), 



is called variance-optimal ELMM. 
Here 

dQ 
dP 

and (£i(M^))o<f<T is the Dolean exponential of martingale . 

It is well-known (see, e.g., Schweizer [37], [38]) that under condition (c.l) 
variance-optimal ELMM P exist. 

Denote 

dP 

z T := 



dP 



and introduce RCLL process z = (zt)o<t<T by the relation 

z t = E p {z T /F T ), 0<t<T. 

Then, by Schweizer 



zt = z + C'tdMl (3.11) 
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where ( = ((t)o<t<T is the -Revalued P-predictable process with 



i 

J £d(M) t ( t <oo, 



and the process ( J Q' s dMf) 0<t<T is a P-martingale. 
o 

Relation (13.111) easily implies that the process z is actually continuous. 
Suppose, in addition to (c.l), that the following condition is satisfied: 

(c.*) all P-local martingales are continuous. 

This technical assumption is satisfied in stochastic volatility models, where 
F = F w is the natural filtration generated by the Wiener process. 

It shown in Mania and Tevzadze [31] , Mania et al. [25] that under conditions 
(c.l) and (c*) density Zt of variance optimal ELMM is uniquely characterized 
by the relation 

_ Sr(( V - kj ■ M°) 
ZT ~ - k)' ■ MO) ' {d - U) 

where (p together with the pair (L, c) is the unique solution of the following 
equation 

er(( V -^-M ) =c£U _ k , Ml (313) 

where L G Mq loc (P), (L, M) = 0, c is a constant. 

Moreover, the process ( = (Ct)o<t<T from (13. lip has the form 

(t = (<Pt- k t )£ t {(<p-k)'-M ). (3.14) 

Here <p = ((ft)o<t<T is a i? d -valued, F-predictable process with 

T 

ip' t d{M) t <p t < oo. 



Let t be P-stopping time. 

Denote (k' ■ M) Tt = (k' ■ M) T - (k' ■ M) T . 

Proposition 3.1 (see also Biagini et al. [3], LLaurent and Pham |18j). 

1. Equation <\3.13t\ is equivalent to equation 

^p. = ^, (3.15) 

where the R d -valued process M* = (M t *)o<t<T is given by the relation 
dM* = 2d(M) t k t + dM t , M* = 0. 

2. a) If there exists the martingale m = (m t ) < t < T , m 6 A^q 1oc (P) such that 

e -{k'-M) T = c + mT; ^ M ^ = 0; ( 3 . 16 ) 
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then cp = and = J -^h^ dm s solve the equation (\3. 15§ . 



In this case 

£r(-k' ■ M°) 
E&r(-k' ■ M°) 
process ( = (Ct)o<t<T from (Iff. lip is equal to 

Ct = -h£ t (-k' ■ M°) 

and 



(3.17) 



E 



5] iTr 



1 



E ( e -(k>-M) TT /JT ) ' 



b) If there exist R d -valued F -predictable process t = (it)o<t<T, J i' t d(M)£ t < 

o 

oo and 

T 

e {k'-M) T=c+ J ftdM ^ 


then L = and cp t = c+ jt^, dM , solve the equation (\3.15§ . 
In this case 

z T = £ T (—k' ■ M) (:= zr, the density of minimal martingale measure P), 
and 

z T 

where dP* = S T {-2k' ■ M)dP . 
Proof. 1. By the Yor formula 

£ T (if - 2k)' ■ M) = £ T {^' ■ M — 2k' ■ M) 



and 



8 T \ ip' ■ [ M + 2 J d(M) t k t j - 2 J i/;' t d(M) t k t -2k 1 ■ M 
= £ T (ip' ■ M*)£ T (-2k' ■ M), 
£%(-k' ■ M) = £ T (-2k' ■ M)e {k '- M)T . 



Assertion follows. 

2. a) Note at first that (L, M) = 0. Further, by the formula we can write 



ln(c + m t ) — In c 



t t 

dm s — 

c + m s 



o 



2 J (c + m^ 2 
o 



d(m) s . 



Hence 



e ln(c+m T )-ln C = 
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and thus 

c , T , c + m T e-^'-^T 
c c 

Finally, by the Bayes rule and the Girasnov Theorem 

2 




E 2 {£ T {-k> ■ M)e-( k '- M )T/F T ) 
E*(c + m T /J 7 r)S^(-k' ■ M) c + m T „ {k >. M ) T 



{E{c + m T /F T )) 2 £ 2 {-2k' ■ M) (c + m T ) 2 

1 



E(e(k' ■ M)tt/Ft) ' 
The proof of case 2 b) is quite analogous. 

Proposition is proved. □ 

3.3. Misspecified asset price model and robust hedging. Denote by 
Balli(0,r), r e [0, oo) the closed r-radius ball in the space L = L^dt x dP), 
with the center at the origin, and let 

7i := \h = {hij}, i,j — 1, d : h is F-predictable d x d- matrix 
valued process, rank(/t) = d, 6 Ball^O, r), r e [0,oo)}. (3.18) 
Class TC is called the class of alternatives. 

Fix the value of small parameter 5 > 0, as well as d x <i-matrix valued, 
F-predictable process o° = (er°)o<t<T = ({of^th 1 < j < rf)t such that 
Wij,t\ — cons t, Vi, j, t, the matrix (a ) 2 = <7 (<7 )' is uniformly elliptic, i.e. for 
each vector v t = (v' t , . . . , vf) with probability 1 

d d 

^(a%.^>c^Kf, c>0, 0<t<T, (3.19) 

i,j=l i=l 

and denote 

A 5 = {o : o = o° + Sh, heH}. (3.20) 

Proposition 3.2. Every o from the class As for sufficiently small 5 is F- 
predictable d x d-valued process with bounded elements and the matrix o 2 = oo' 
is uniformly elliptic. 

Proof. The process o is F-predictable as linear combination of F-predictable 
processes. Further, 

\°~ij,t\ = \°~ij,t + filkj,t\ < const + 5r, < 5< 1. 

From (I3.19P and (13.201) for each vector v t = [y\, . . . , vf) we have 

d d 
i,j=l i,j=l 
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d d 

= E (A* )'hA4 + s E W*A4 

i,j=l i,j=l 

d d 

+ 6 E (MOWM + 52 E i hh 'hA4- (3.21) 

ij'=l i,j=l 

Note now that the elements of matrices a and h are bounded. Hence 
choosing 5 sufficiently small we get 

max (S\(a°h') ijtt \, 8\(h(a )) ij>t \, 8 2 \{hti) ij>t \) < |- 

Therefore from (I3.19P and (13.211) we get 

d d 

°%^ v \ v t > ( c — const ■ e) \v % A 2 for each e > 0. 

i,j=l i,j=l 

Proposition is proved. □ 

Consider the set of processes {R a (or X a ), a G As}, which represents the 
misspecified of asset price model. 

Define the class of admissible trading strategies = 0(<r°). 

Proposition 3.3. For each R° '-valued F -predictable process 9 = (6t)o<t<T and 
for each a G Ag, 5 > 0, 

T T 

aE J \6 t \ 2 dC t <E J 6' t a t d{M) t a' t e t 


T T 



E J 6' t a t o' t 6 t dC t < AE J \6 t \ 2 dC t 



where the constants a, A are such that < a < A < 00, and the parameter 
5 > is sufficiently small. 

Proof. Remember that d(M) t = d{M\M^) t = I?* d dC t . Hence 

T T 

E J e' t a t d(M) t a' t e t = E J 6' t o t o' t 6 t dC t . 



Further, since a = cr° + 5h and elements of matrices <r° and h are bounded, 
then the same is true for the elements of matrix a with < 5 < const. Thus 
using the inequality ab < 2 (a 2 + b 2 ) we get 

T T 

e j e' t a t o' t e t dc t <AE j \e t \ 2 dc t . 



On the other hand, by Proposition 13. 21 the matrix a 2 = a a' is uniformly elliptic 
for sufficiently small 5, which yields the first inequality. □ 
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Definition 3.1. The class = @(cr°) is a class of -Revalued F-predictable 
processes 9 = (9 t )o<t<T such that 

T 

E I \e t \ 2 dC t < oo. (3.22) 



Let 9 G be the dollar amount (rather than the number of shares) invested 
in the stock X a , a G A s . Then for each a G Ag the trading gains induced by 
the self-financing portfolio strategy associated to 9 has the form 

t 

G t (a, 9)= [ 9' s dR a s , < t < T, (3.23) 



where R d = (Rf)o<t<T is the yield process given by f)3.10p . 
Introduce the condition: 

(c.2) There exists ELMM Q such that the density process z = z Q satisfies 
the reverse Holder inequality R 2 (P), see definition in RSch [33J. 

It is well-known that under the conditions (c.l) and (c.2) the density process 
z = (zt)<t<r of the variance-optimal ELMM satisfies R2{P) as well, see Dolean 
et al. [5]. 

Now under the conditions (c.l) and (c.2) the r.v. Gt{v, 9) G L 2 (P), Wa G Ag, 
and the space Gt(ct, ©) is closed in L 2 (P), Vcr G As (see, e.g., Theorem 2 of 
RSch |331). 

A contingent claim is an jF T -measurable square-integrable r.v. H, which 
models the payoff from a financial product at the maturity date T. 

The problem we are interested in is to find the robust hedging strategy 
for a contingent claim H in the above described incomplete financial market 
model with misspecified asset price process X° ', a G Ag, using mean-variance 
approach. 

For each a G As, the total loss of a hedger, who starts with the initial capital 
x, uses the strategy 9, believes that the stock price process follows X° and 
has to pay a random amount H at the date T, is H-x-Gt{&, 9). 

Denote 

J(a,9) := E{H -x-G T {a,9)) 2 . (3.24) 

One setting of the robust mean- variance hedging problem consist in solving 
the optimization problem 

minimize sup J7"(cr, 9) over all strategies 9 G 0. (3.25) 

aeA s 

We "slightly" change this problem using the approach developed in Toron- 
jadze [JT] which based on the following approximation 

sup J (a, 9) = exp { sup In J (a + 5h, 9) } 

(t€A s hen 

* exp ( sup [ In J{a\ 9) + 8 DJ }°) h > 9 
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DJ(o°,h,\ 



where 



J (a , 0) exp { 5 sup 

hew J {cr u , V) 



DJ((7°, fc, 0) := — ./(a + Sh, 9)\ s= o = lim ■ 



is the Gateaux differential of the functional J at the point a in the direction h. 

Approximate (in leading order 5) the optimization problem (13. 25j) by the 
problem 

... r pj(a° t h,e) 

minimize J/ (a , 0) exp < o sup — — 



over all strategies 6 e 0. (3.26) 

Note that each solution (9* of the problem (I3.26P minimizes J7"(cr°, 0) under 
the constraint 

DJ(a°,h,9) ^ DJ{a°,h,6*) 

This characterization of an optimal strategy #* of the problem f!3.26|) leads 
to the 

Definition 3.2. The trading strategy 6* e is called optimal mean-variance 
robust trading strategy against the class of alternatives Ti if it is a solution of 
the optimization problem 

minimize jT(cr°, 9) over all strategies G 0, subject to constraint 

DJ(o°,h,0) 
fteH J[cr ,V) 
where c is some generic constant. 

Remark 3.2. In contrast to "mean- variance robust" trading strategy which 
associates with optimization problem (I3.25j) and control theory, we find the 
"optimal mean- variance robust" strategy in the sense of Definition 13.21 Such 
approach and term are common in robust statistics theory (see, e.g., Hampel 
et al. p2], Rieder [M]). 

Does the suggested approach provide "good" approximation? Consider the 
case. 

Diffusion model with zero drift. Let a standard Wiener process w = 
{wt)o<t<T be given on the complete probability space (fi, J 7 , P). Denote by 
pw _ (jp™^ o < t < T) the P-augmentation of the natural filtration Tf = 
a(w s , 0<s<t),0<t<T, generated by w. 

Let the stock price process be modeled by the equation 

dX u t = X u t ■ a t dw t , X CT > 0, < t < T, 



OPTIMAL ROBUST MEAN- VARIANCE HEDGING 29 

where o G As with 



i 

/< 



of) 2 eft < oo 





and h G Ball^^tx^CO, r), < r < oo. All considered processes are real- 
valued. 

Denote by R a the yield process, i.e., 

dR a t = a t dw t , R% = 0, < t < T. 
The wealth at maturity T, with the initial endowment x, is equal to 



V*' 9 (a) = x + I Q t dR° 



T 

9 ( \ — _L 

' T 

Let, further, the contingent claim H be -measurable P-square-integrable 
r.v. 

Consider the optimization problem (I3.25p . It is easy to see that if a G As; 
then 

of - 5r < a t < o f ° + 8r, < t < T, P-a.s., 
By the martingale representation theorem 

T 

H = EH + / ip? dw t , P-a.s., 



where tp H is the ^^-predictable process with 

T 

E J {<pj*) 2 dt < oo. (3.28) 



Hence 



E(H — Vf{a)) 2 = (EH -x) 2 + E J (ip? - o t 6 t f dt. 



From this it directly follows that the process 

oW) = {a%-8r)I n I {M o } 

{-t>°rs 
'° i- ur ) 1 s _ 

is a solution of the optimization problem 

, 2 



+ (of + 6r)I n 7 {M0} , < t < T, (3.29) 

i 9, <fT i I 



maximize P(if — V£' (o)) over all o G with a given 9 G 0. 
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It remains to minimize (w.r.t. 9) the expression 

T 



E J (rf - a;{9)6 t ) 2 dt. 



From (I3.29p it easily follows that the equation (w.r.t. 
has no solution, but 



e* t =%I w o m , 0<t<T, (3.30) 



solves problem. We assume that 0/0 := 0. 

Consider now the optimization problem (I3.27p . 
For each fixed h 

T 



J{a, 9) = e(h -x- J 9 t dR a t 



o 

T T 



E^H-x- J 9 t a° t dw t -5 J 9 t h t dwA 



2 





T 



J(a°,9) - 25E 



(EH - x + J (iff- 9 t a Q t ) dw t ^ J 9 t h t dw t 



+ 5 2 E I 9 2 t h 2 t dt, 



and hence 



DJ(a°, h; 9)=2E J (9 t a Q t - <p?)9 t ht dt, (3.31) 
o 

as follows from (I3.28p . the definition of the class 7i and the estimation 

T T T 

(e J (9 t a° t - tp?)9 t h t dt} <E J (9 t a° t - 9?f dt E J 9\h\ dt 



o 

T 



< const -r 2 ^E j 9 2 (a G t f dt + E J (</?f ) 2 dtj E J 9 2 t dt < oo. (3.32) 



Since, further, DJ(a°, h; 9) = for h = 0, using (13 .32j) we get 

< sup D J (a , h; 9) < oo. 

h&i 
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Hence we can take < c < oo in problem (6). Now if we substitute 9* from 
(ET301) into (jMH) , we get DJ{a°, h; 9*) = for each h, and thus 

sup D J (<x°, h; 9*) 

h&H _ n 

j(o°,e*) ~ ' 

If we recall that 9* = arg min J(a°,9), we get that 6* defined by ( 13.301) is a 

solution of this optimization problem as well. 
Thus we prove that 

(a) the mean-variance robust trading strategy 9* = (# t *)o<t<r for the opti- 
mization problem (\3. 25$ is given by the formula 

a* rt t 

°t 

(b) at the same time this strategy is an optimal mean-variance robust trading 
strategy for the optimization problem (3.27). 

Hence in this case the suggested approach leads to the perfect solution of 
initial problem {\3.2o^i . 

To solve the problem (13.271) in general case we need to calculate DJ(a°, h, 9). 
Suppose that k = (k t ) < t < T = {k i)t , 1 < i < d) < t <T from (13.101) is such that 
\ki t t\ < const Wi, t. 

Following RSch [33] and GLP [IT] introduce the probability measure Q ~ P 
on JF T by the relation 

— ~2 

dQ = ^dP ( and hence dQ = % dp) . (3.33) 
z V z J 

Using Proposition 5.1 of GLP [TTJ we can write 

T 



J(a, 9) = \H - x - I ' 9' t dR1 

Z Z T \ J 





T 



Z 



where 



^Efi l^-x- U°{a)d^- [ (rt(a))'d^z 

\ Zt J z t J z t 

x 

:=J{a,^ ,^ r ) (or J{a,i>) with ^ = (^ ,^ 1 )'), (3.34) 
$ = V>tV) = j 0' s a s dM° s - 9[a t Ml 0<t<T. 



(3.35) 
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V> t » = ^V) + Sti(h), = ^(a°) + fy°(h). 

Let (following RSch pj) 

T 



H 



H 



z = E[^I )+ l(^)'dU t + L T , 



(3.36) 



be the Galtchouk-Kunita-Watanabe decomposition of r.v. S- z w.r. t. Re- 
valued Q-local martingale U = (?S-, M^-zqY , where ip H = (ip°' H , tp 1 ' 11 )' G 
L 2 (U,Q), the space of F-predictable processes i/j such that / i/i'dU G Ai 2 (Q) 
of martingale, and L G Ail \ oc {Q), L is Q-strongly orthogonal to U. 
Remember that 

^ = (V>W- (3-37) 
Then, using (13.341) . (I3.35P and (13.361) we can write for each h 

J(a° + Sh, if>) = J(<r°, ij)+6- 2J- 1 

T T 



;E®< 



(Mh))'dU t 



o 



T -| 2 

mh)ydu t 

L o 

T T 
0\ „/.-HV, 











(V*(0 - W / WtWYdUt 



(Mh))'dU t 



(3.38) 



Using Proposition 8 of RSch [33] we have for each ft, 



5r 



G r (h,e) 



and hence by (13. 23j) 



(i/)(h))'dU t :ip{h) EL 2 (U,Q) 



T ,2 

E9\ f^ t {h))'dU t 



~2 



£ Q Jj-C4(M) = I EG 2 T (h,6) =zqE[ 9' t dR 
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, T v 2 / T T \ 

z E I y ^MM° J =z e( J 9' t h t d(M) t k t + J 9' t h t dM t 



< const 



e( J \6' t h t d(M) t k t \\ +E(je' t h t dM t 



< const r 2 E / \6 t \ 2 dC t < oo. 



(3.39) 



Further, 



E Q 




T 



<E^i (Ma°) - tf)'dU t ) Eft / (Mh)YdU t < oo. (3.40) 



From these estimates we conclude that: 

r 

1) DJ(a°,h,iP) = 2z^ x Eft J(M<r°) - ^Yd(U) t ipt(h) < oo, 

o 

thanks to (13391 . 

2) DJV°, /i,-0)U=o = 0, since ^(0) = by fl3737j) and fl3T35|) . 
Thus 

sup D J{a°, h, if}) > 0. 
hen 

3) From (EUOl and (13T391 we get 

(DJ(a°, h, ip)f < const z^r 2 

T T 

/ (V>t(*°) - ^f)'d(U) t (^ t {a ) - i)f)E I \9 t \ 2 dC t < oo. 



(3.41) 



(3.42) 



Thus \DJ{a°, h,ip)\ is estimated by the expression which does not depend on 
h, and is equal to zero if we substitute ^(ct°) =ipf,0<t< T. 
Hence, by (E32]) 



< supA7(*°,M)U^ 
hen 



h < sup \DJ(a°,h, 
hen 



I Ip—tp* 







(3.43) 



Further, from (13.421) follows that we can take c G [0, oo) in (13.271) . 
Now substituting tp = tp H into J7"(cr°, ip) and D<J(a°, h, ip) we get 



J(<r Q ,if> H ) =min l 7(a°,V) = z^ 1 (E p H - xf + z^ 1 E Q L 
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(see Lemma 5.1 of GLP [IT] ) and 

DJ(a°,h,^ H ) 
sup — = = 0. 

hen J(a\^ H ) 

Hence the constraint of problem (I3.27j) is satisfied. 

Remark 3.3. If x = E P H and Lt = 0, then we get 

DJ{a°, h, tfj H ) _ 

which is assumed to be zero, since if we consider the shifted risk functional 
J = J + 1, the optimization problem and the optimal trading strategy will 
not change, but DJ(a°, h, ip H ) = DJ(a°, h, ip H ) = and J(a°, tp H ) = 1. 

Finally, using Proposition 8 of RSch [33] we arrive at the following 

Theorem 3.1. In Model ( Iff.lOj) under conditions (c.l) and (c.2) the optimal 
mean-variance robust trading strategy (in the sense of Definition ^, ip is given 
by the formula 

01 = (K°)T W + Ct(V t * - (V-f )'U t )], < t < T, (3.44) 



where 



,H I iO,H il,H\t tt ( z ° 

i>t =m )> u t = — , — z 

x + f (^)'dU t 






ipf and Ct are given by the relations ( Iff. 3ty and (Iff, lip , respectively, ~z t is defined 
in (dJZD- 
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